Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix conda on windows #2676

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

[CI] Fix conda on windows #2676

wants to merge 2 commits into from

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Dec 20, 2024

Description

Describe your changes in detail.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

  • I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds core functionality)
  • Breaking change (fix or feature that would cause existing functionality to change)
  • Documentation (update in the documentation)
  • Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

  • I have read the CONTRIBUTION guide (required)
  • My change requires a change to the documentation.
  • I have updated the tests accordingly (required for a bug fix or a new feature).
  • I have updated the documentation accordingly.

Copy link

pytorch-bot bot commented Dec 20, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2676

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (8 Unrelated Failures)

As of commit 2294f37 with merge base ab4250e (image):

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 20, 2024
Copy link

github-actions bot commented Dec 20, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4388s 0.4369s 2.2890 Ops/s 2.2094 Ops/s $\color{#35bf28}+3.60\%$
test_transformed 0.6266s 0.6215s 1.6090 Ops/s 1.6243 Ops/s $\color{#d91a1a}-0.94\%$
test_serial 1.3827s 1.3808s 0.7242 Ops/s 0.7142 Ops/s $\color{#35bf28}+1.41\%$
test_parallel 1.3107s 1.2241s 0.8170 Ops/s 0.8125 Ops/s $\color{#35bf28}+0.54\%$
test_step_mdp_speed[True-True-True-True-True] 0.2451ms 31.3934μs 31.8538 KOps/s 31.8035 KOps/s $\color{#35bf28}+0.16\%$
test_step_mdp_speed[True-True-True-True-False] 57.9280μs 18.8250μs 53.1208 KOps/s 54.4507 KOps/s $\color{#d91a1a}-2.44\%$
test_step_mdp_speed[True-True-True-False-True] 51.0350μs 17.8027μs 56.1713 KOps/s 56.8712 KOps/s $\color{#d91a1a}-1.23\%$
test_step_mdp_speed[True-True-True-False-False] 46.2160μs 10.4053μs 96.1049 KOps/s 96.5390 KOps/s $\color{#d91a1a}-0.45\%$
test_step_mdp_speed[True-True-False-True-True] 82.6450μs 33.9015μs 29.4973 KOps/s 29.8929 KOps/s $\color{#d91a1a}-1.32\%$
test_step_mdp_speed[True-True-False-True-False] 53.5400μs 20.6609μs 48.4005 KOps/s 49.6081 KOps/s $\color{#d91a1a}-2.43\%$
test_step_mdp_speed[True-True-False-False-True] 47.3280μs 19.8757μs 50.3126 KOps/s 51.3995 KOps/s $\color{#d91a1a}-2.11\%$
test_step_mdp_speed[True-True-False-False-False] 47.3280μs 12.4805μs 80.1248 KOps/s 82.0350 KOps/s $\color{#d91a1a}-2.33\%$
test_step_mdp_speed[True-False-True-True-True] 70.1610μs 35.8730μs 27.8761 KOps/s 28.3654 KOps/s $\color{#d91a1a}-1.73\%$
test_step_mdp_speed[True-False-True-True-False] 0.1000ms 22.4825μs 44.4789 KOps/s 44.3679 KOps/s $\color{#35bf28}+0.25\%$
test_step_mdp_speed[True-False-True-False-True] 49.6130μs 19.7783μs 50.5605 KOps/s 51.5560 KOps/s $\color{#d91a1a}-1.93\%$
test_step_mdp_speed[True-False-True-False-False] 48.1300μs 12.4884μs 80.0743 KOps/s 82.5616 KOps/s $\color{#d91a1a}-3.01\%$
test_step_mdp_speed[True-False-False-True-True] 0.1184ms 37.8705μs 26.4058 KOps/s 26.7416 KOps/s $\color{#d91a1a}-1.26\%$
test_step_mdp_speed[True-False-False-True-False] 58.5090μs 24.4727μs 40.8618 KOps/s 41.7869 KOps/s $\color{#d91a1a}-2.21\%$
test_step_mdp_speed[True-False-False-False-True] 0.1060ms 21.4102μs 46.7068 KOps/s 46.3940 KOps/s $\color{#35bf28}+0.67\%$
test_step_mdp_speed[True-False-False-False-False] 38.2410μs 14.3619μs 69.6287 KOps/s 71.5842 KOps/s $\color{#d91a1a}-2.73\%$
test_step_mdp_speed[False-True-True-True-True] 86.5020μs 35.8303μs 27.9094 KOps/s 28.3263 KOps/s $\color{#d91a1a}-1.47\%$
test_step_mdp_speed[False-True-True-True-False] 49.5520μs 22.4787μs 44.4866 KOps/s 45.1280 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[False-True-True-False-True] 0.1046ms 23.8065μs 42.0054 KOps/s 44.5382 KOps/s $\textbf{\color{#d91a1a}-5.69\%}$
test_step_mdp_speed[False-True-True-False-False] 84.4480μs 13.6455μs 73.2844 KOps/s 72.6821 KOps/s $\color{#35bf28}+0.83\%$
test_step_mdp_speed[False-True-False-True-True] 73.0370μs 37.6069μs 26.5909 KOps/s 27.0140 KOps/s $\color{#d91a1a}-1.57\%$
test_step_mdp_speed[False-True-False-True-False] 67.4560μs 24.6290μs 40.6026 KOps/s 41.7732 KOps/s $\color{#d91a1a}-2.80\%$
test_step_mdp_speed[False-True-False-False-True] 2.4400ms 24.9498μs 40.0805 KOps/s 42.1045 KOps/s $\color{#d91a1a}-4.81\%$
test_step_mdp_speed[False-True-False-False-False] 47.7990μs 15.7352μs 63.5519 KOps/s 64.7795 KOps/s $\color{#d91a1a}-1.90\%$
test_step_mdp_speed[False-False-True-True-True] 78.8670μs 39.9006μs 25.0623 KOps/s 25.6387 KOps/s $\color{#d91a1a}-2.25\%$
test_step_mdp_speed[False-False-True-True-False] 77.4520μs 26.5052μs 37.7285 KOps/s 38.0591 KOps/s $\color{#d91a1a}-0.87\%$
test_step_mdp_speed[False-False-True-False-True] 82.9250μs 24.6295μs 40.6016 KOps/s 41.1554 KOps/s $\color{#d91a1a}-1.35\%$
test_step_mdp_speed[False-False-True-False-False] 68.8190μs 15.6817μs 63.7686 KOps/s 64.5198 KOps/s $\color{#d91a1a}-1.16\%$
test_step_mdp_speed[False-False-False-True-True] 94.6560μs 41.7888μs 23.9298 KOps/s 24.7467 KOps/s $\color{#d91a1a}-3.30\%$
test_step_mdp_speed[False-False-False-True-False] 80.4500μs 28.2306μs 35.4225 KOps/s 36.1027 KOps/s $\color{#d91a1a}-1.88\%$
test_step_mdp_speed[False-False-False-False-True] 86.5010μs 26.1985μs 38.1701 KOps/s 38.4990 KOps/s $\color{#d91a1a}-0.85\%$
test_step_mdp_speed[False-False-False-False-False] 43.5920μs 17.5514μs 56.9754 KOps/s 57.8803 KOps/s $\color{#d91a1a}-1.56\%$
test_values[generalized_advantage_estimate-True-True] 10.4387ms 9.8316ms 101.7133 Ops/s 102.0210 Ops/s $\color{#d91a1a}-0.30\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.5951ms 33.3390ms 29.9949 Ops/s 29.9517 Ops/s $\color{#35bf28}+0.14\%$
test_values[td0_return_estimate-False-False] 0.2569ms 0.1814ms 5.5140 KOps/s 5.6655 KOps/s $\color{#d91a1a}-2.67\%$
test_values[td1_return_estimate-False-False] 25.1609ms 24.4281ms 40.9364 Ops/s 40.8463 Ops/s $\color{#35bf28}+0.22\%$
test_values[vec_td1_return_estimate-False-False] 34.7931ms 33.4018ms 29.9385 Ops/s 29.7946 Ops/s $\color{#35bf28}+0.48\%$
test_values[td_lambda_return_estimate-True-False] 36.4398ms 34.6426ms 28.8662 Ops/s 28.1266 Ops/s $\color{#35bf28}+2.63\%$
test_values[vec_td_lambda_return_estimate-True-False] 35.4715ms 33.4248ms 29.9179 Ops/s 29.8356 Ops/s $\color{#35bf28}+0.28\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 12.5428ms 8.4508ms 118.3325 Ops/s 116.8323 Ops/s $\color{#35bf28}+1.28\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.4433ms 1.8586ms 538.0380 Ops/s 522.8018 Ops/s $\color{#35bf28}+2.91\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.4579ms 0.3600ms 2.7777 KOps/s 2.7215 KOps/s $\color{#35bf28}+2.07\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 38.8150ms 36.0389ms 27.7478 Ops/s 27.5736 Ops/s $\color{#35bf28}+0.63\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.0142ms 3.0479ms 328.0978 Ops/s 327.9643 Ops/s $\color{#35bf28}+0.04\%$
test_dqn_speed[False-None] 6.0853ms 1.4234ms 702.5311 Ops/s 711.0123 Ops/s $\color{#d91a1a}-1.19\%$
test_dqn_speed[False-backward] 2.0839ms 1.9125ms 522.8775 Ops/s 523.2188 Ops/s $\color{#d91a1a}-0.07\%$
test_dqn_speed[True-None] 0.7679ms 0.4870ms 2.0532 KOps/s 2.0416 KOps/s $\color{#35bf28}+0.57\%$
test_dqn_speed[True-backward] 0.9643ms 0.9045ms 1.1056 KOps/s 1.0395 KOps/s $\textbf{\color{#35bf28}+6.36\%}$
test_dqn_speed[reduce-overhead-None] 0.6580ms 0.4851ms 2.0615 KOps/s 2.0617 KOps/s $\color{#d91a1a}-0.01\%$
test_dqn_speed[reduce-overhead-backward] 1.0080ms 0.9144ms 1.0936 KOps/s 1.0837 KOps/s $\color{#35bf28}+0.91\%$
test_ddpg_speed[False-None] 3.5586ms 2.9382ms 340.3413 Ops/s 345.4201 Ops/s $\color{#d91a1a}-1.47\%$
test_ddpg_speed[False-backward] 4.1837ms 4.0534ms 246.7045 Ops/s 245.2347 Ops/s $\color{#35bf28}+0.60\%$
test_ddpg_speed[True-None] 1.6705ms 1.0195ms 980.8902 Ops/s 962.2352 Ops/s $\color{#35bf28}+1.94\%$
test_ddpg_speed[True-backward] 2.1713ms 1.9718ms 507.1416 Ops/s 499.3141 Ops/s $\color{#35bf28}+1.57\%$
test_ddpg_speed[reduce-overhead-None] 1.4291ms 1.0157ms 984.5199 Ops/s 966.6671 Ops/s $\color{#35bf28}+1.85\%$
test_ddpg_speed[reduce-overhead-backward] 1.9826ms 1.9073ms 524.3067 Ops/s 514.2436 Ops/s $\color{#35bf28}+1.96\%$
test_sac_speed[False-None] 8.9895ms 8.2466ms 121.2626 Ops/s 121.3531 Ops/s $\color{#d91a1a}-0.07\%$
test_sac_speed[False-backward] 12.9004ms 11.3198ms 88.3405 Ops/s 90.4314 Ops/s $\color{#d91a1a}-2.31\%$
test_sac_speed[True-None] 2.3135ms 1.8480ms 541.1396 Ops/s 518.0311 Ops/s $\color{#35bf28}+4.46\%$
test_sac_speed[True-backward] 3.9674ms 3.6115ms 276.8941 Ops/s 273.2113 Ops/s $\color{#35bf28}+1.35\%$
test_sac_speed[reduce-overhead-None] 2.2225ms 1.8680ms 535.3302 Ops/s 525.3523 Ops/s $\color{#35bf28}+1.90\%$
test_sac_speed[reduce-overhead-backward] 4.1831ms 3.6240ms 275.9387 Ops/s 270.8402 Ops/s $\color{#35bf28}+1.88\%$
test_redq_speed[False-None] 15.1543ms 13.3753ms 74.7648 Ops/s 74.2613 Ops/s $\color{#35bf28}+0.68\%$
test_redq_speed[False-backward] 30.7728ms 23.1507ms 43.1952 Ops/s 43.8834 Ops/s $\color{#d91a1a}-1.57\%$
test_redq_speed[True-None] 5.5471ms 4.9603ms 201.5987 Ops/s 198.9715 Ops/s $\color{#35bf28}+1.32\%$
test_redq_speed[True-backward] 13.9099ms 12.8519ms 77.8098 Ops/s 79.6227 Ops/s $\color{#d91a1a}-2.28\%$
test_redq_speed[reduce-overhead-None] 5.7268ms 4.9824ms 200.7048 Ops/s 191.6927 Ops/s $\color{#35bf28}+4.70\%$
test_redq_speed[reduce-overhead-backward] 13.8402ms 12.5355ms 79.7731 Ops/s 78.8948 Ops/s $\color{#35bf28}+1.11\%$
test_redq_deprec_speed[False-None] 15.0471ms 13.4564ms 74.3140 Ops/s 74.0226 Ops/s $\color{#35bf28}+0.39\%$
test_redq_deprec_speed[False-backward] 21.1397ms 19.2845ms 51.8551 Ops/s 51.1352 Ops/s $\color{#35bf28}+1.41\%$
test_redq_deprec_speed[True-None] 4.1885ms 3.7196ms 268.8484 Ops/s 272.1516 Ops/s $\color{#d91a1a}-1.21\%$
test_redq_deprec_speed[True-backward] 9.6114ms 9.0121ms 110.9625 Ops/s 118.1628 Ops/s $\textbf{\color{#d91a1a}-6.09\%}$
test_redq_deprec_speed[reduce-overhead-None] 4.7523ms 3.7362ms 267.6543 Ops/s 275.6400 Ops/s $\color{#d91a1a}-2.90\%$
test_redq_deprec_speed[reduce-overhead-backward] 10.8522ms 8.8029ms 113.5987 Ops/s 114.5626 Ops/s $\color{#d91a1a}-0.84\%$
test_td3_speed[False-None] 9.3362ms 8.3186ms 120.2122 Ops/s 121.7942 Ops/s $\color{#d91a1a}-1.30\%$
test_td3_speed[False-backward] 16.6742ms 10.8565ms 92.1107 Ops/s 93.7609 Ops/s $\color{#d91a1a}-1.76\%$
test_td3_speed[True-None] 2.0212ms 1.7530ms 570.4391 Ops/s 567.5036 Ops/s $\color{#35bf28}+0.52\%$
test_td3_speed[True-backward] 3.9064ms 3.5566ms 281.1642 Ops/s 283.6743 Ops/s $\color{#d91a1a}-0.88\%$
test_td3_speed[reduce-overhead-None] 1.9484ms 1.7695ms 565.1272 Ops/s 566.3869 Ops/s $\color{#d91a1a}-0.22\%$
test_td3_speed[reduce-overhead-backward] 3.7575ms 3.4584ms 289.1545 Ops/s 292.7338 Ops/s $\color{#d91a1a}-1.22\%$
test_cql_speed[False-None] 40.1932ms 37.8972ms 26.3872 Ops/s 26.4975 Ops/s $\color{#d91a1a}-0.42\%$
test_cql_speed[False-backward] 57.3964ms 49.8463ms 20.0617 Ops/s 20.6715 Ops/s $\color{#d91a1a}-2.95\%$
test_cql_speed[True-None] 18.6883ms 16.3291ms 61.2405 Ops/s 63.2038 Ops/s $\color{#d91a1a}-3.11\%$
test_cql_speed[True-backward] 24.5472ms 23.1788ms 43.1429 Ops/s 42.4550 Ops/s $\color{#35bf28}+1.62\%$
test_cql_speed[reduce-overhead-None] 17.7356ms 15.9971ms 62.5111 Ops/s 61.4704 Ops/s $\color{#35bf28}+1.69\%$
test_cql_speed[reduce-overhead-backward] 26.8139ms 23.4953ms 42.5616 Ops/s 43.4608 Ops/s $\color{#d91a1a}-2.07\%$
test_a2c_speed[False-None] 9.6490ms 7.6403ms 130.8855 Ops/s 135.0873 Ops/s $\color{#d91a1a}-3.11\%$
test_a2c_speed[False-backward] 17.7123ms 15.2319ms 65.6519 Ops/s 67.6749 Ops/s $\color{#d91a1a}-2.99\%$
test_a2c_speed[True-None] 4.8913ms 4.3656ms 229.0649 Ops/s 232.8897 Ops/s $\color{#d91a1a}-1.64\%$
test_a2c_speed[True-backward] 12.7885ms 11.7321ms 85.2365 Ops/s 90.6268 Ops/s $\textbf{\color{#d91a1a}-5.95\%}$
test_a2c_speed[reduce-overhead-None] 5.0326ms 4.3911ms 227.7346 Ops/s 232.3439 Ops/s $\color{#d91a1a}-1.98\%$
test_a2c_speed[reduce-overhead-backward] 12.7937ms 11.3517ms 88.0926 Ops/s 86.3411 Ops/s $\color{#35bf28}+2.03\%$
test_ppo_speed[False-None] 9.5639ms 7.8622ms 127.1911 Ops/s 125.2611 Ops/s $\color{#35bf28}+1.54\%$
test_ppo_speed[False-backward] 15.8718ms 15.4221ms 64.8422 Ops/s 63.8200 Ops/s $\color{#35bf28}+1.60\%$
test_ppo_speed[True-None] 4.4033ms 3.8061ms 262.7331 Ops/s 263.0286 Ops/s $\color{#d91a1a}-0.11\%$
test_ppo_speed[True-backward] 10.4881ms 10.1150ms 98.8631 Ops/s 101.4862 Ops/s $\color{#d91a1a}-2.58\%$
test_ppo_speed[reduce-overhead-None] 4.4060ms 3.7933ms 263.6214 Ops/s 262.4818 Ops/s $\color{#35bf28}+0.43\%$
test_ppo_speed[reduce-overhead-backward] 11.0959ms 10.1170ms 98.8436 Ops/s 102.3254 Ops/s $\color{#d91a1a}-3.40\%$
test_reinforce_speed[False-None] 8.2570ms 6.7791ms 147.5125 Ops/s 147.9937 Ops/s $\color{#d91a1a}-0.33\%$
test_reinforce_speed[False-backward] 10.8980ms 10.3378ms 96.7322 Ops/s 98.2875 Ops/s $\color{#d91a1a}-1.58\%$
test_reinforce_speed[True-None] 3.2630ms 2.7269ms 366.7207 Ops/s 366.5949 Ops/s $\color{#35bf28}+0.03\%$
test_reinforce_speed[True-backward] 10.0397ms 9.0151ms 110.9252 Ops/s 113.4910 Ops/s $\color{#d91a1a}-2.26\%$
test_reinforce_speed[reduce-overhead-None] 3.4114ms 2.7764ms 360.1729 Ops/s 365.1420 Ops/s $\color{#d91a1a}-1.36\%$
test_reinforce_speed[reduce-overhead-backward] 10.4739ms 9.1647ms 109.1145 Ops/s 110.8057 Ops/s $\color{#d91a1a}-1.53\%$
test_iql_speed[False-None] 36.1863ms 33.7453ms 29.6338 Ops/s 29.5834 Ops/s $\color{#35bf28}+0.17\%$
test_iql_speed[False-backward] 51.2638ms 47.3963ms 21.0987 Ops/s 21.1944 Ops/s $\color{#d91a1a}-0.45\%$
test_iql_speed[True-None] 12.0702ms 11.0777ms 90.2715 Ops/s 88.9650 Ops/s $\color{#35bf28}+1.47\%$
test_iql_speed[True-backward] 24.5355ms 22.5759ms 44.2950 Ops/s 44.0858 Ops/s $\color{#35bf28}+0.47\%$
test_iql_speed[reduce-overhead-None] 12.9974ms 11.2856ms 88.6087 Ops/s 89.4807 Ops/s $\color{#d91a1a}-0.97\%$
test_iql_speed[reduce-overhead-backward] 24.4933ms 22.6256ms 44.1978 Ops/s 44.4627 Ops/s $\color{#d91a1a}-0.60\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.7345ms 5.2667ms 189.8740 Ops/s 192.2039 Ops/s $\color{#d91a1a}-1.21\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8871ms 0.5430ms 1.8418 KOps/s 1.8500 KOps/s $\color{#d91a1a}-0.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7889ms 0.5121ms 1.9526 KOps/s 1.9675 KOps/s $\color{#d91a1a}-0.75\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.0313ms 5.1317ms 194.8666 Ops/s 198.6816 Ops/s $\color{#d91a1a}-1.92\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.3936s 0.8367ms 1.1951 KOps/s 1.9196 KOps/s $\textbf{\color{#d91a1a}-37.74\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8566ms 0.5080ms 1.9685 KOps/s 1.9893 KOps/s $\color{#d91a1a}-1.04\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.0524ms 1.6879ms 592.4505 Ops/s 592.9846 Ops/s $\color{#d91a1a}-0.09\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2428ms 1.6061ms 622.6134 Ops/s 625.9164 Ops/s $\color{#d91a1a}-0.53\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.7610ms 5.2873ms 189.1307 Ops/s 197.1100 Ops/s $\color{#d91a1a}-4.05\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0111ms 0.6774ms 1.4763 KOps/s 1.5127 KOps/s $\color{#d91a1a}-2.41\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0407ms 0.6567ms 1.5229 KOps/s 1.5534 KOps/s $\color{#d91a1a}-1.97\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.7037ms 5.0362ms 198.5609 Ops/s 198.5444 Ops/s $+0.01\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.7931ms 0.5365ms 1.8639 KOps/s 1.8251 KOps/s $\color{#35bf28}+2.12\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7998ms 0.5141ms 1.9453 KOps/s 1.9855 KOps/s $\color{#d91a1a}-2.03\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.2710ms 4.9915ms 200.3402 Ops/s 204.1273 Ops/s $\color{#d91a1a}-1.86\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.6306ms 0.5304ms 1.8853 KOps/s 1.9223 KOps/s $\color{#d91a1a}-1.92\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7413ms 0.5035ms 1.9863 KOps/s 1.9731 KOps/s $\color{#35bf28}+0.67\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.7961ms 5.1792ms 193.0787 Ops/s 197.9383 Ops/s $\color{#d91a1a}-2.46\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.0530ms 0.6887ms 1.4520 KOps/s 1.5057 KOps/s $\color{#d91a1a}-3.57\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9040ms 0.6487ms 1.5416 KOps/s 1.5324 KOps/s $\color{#35bf28}+0.60\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.4620s 13.5159ms 73.9869 Ops/s 35.9944 Ops/s $\textbf{\color{#35bf28}+105.55\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.4272ms 2.4134ms 414.3499 Ops/s 442.1024 Ops/s $\textbf{\color{#d91a1a}-6.28\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.5261ms 1.3865ms 721.2245 Ops/s 691.7802 Ops/s $\color{#35bf28}+4.26\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.8460ms 4.3264ms 231.1382 Ops/s 221.6512 Ops/s $\color{#35bf28}+4.28\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 5.6556ms 2.3454ms 426.3650 Ops/s 386.2220 Ops/s $\textbf{\color{#35bf28}+10.39\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 8.0209ms 1.4295ms 699.5451 Ops/s 715.5779 Ops/s $\color{#d91a1a}-2.24\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.4753s 14.2393ms 70.2284 Ops/s 224.0258 Ops/s $\textbf{\color{#d91a1a}-68.65\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.1149ms 2.4585ms 406.7457 Ops/s 401.8012 Ops/s $\color{#35bf28}+1.23\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 1.9636ms 1.3699ms 729.9582 Ops/s 670.4072 Ops/s $\textbf{\color{#35bf28}+8.88\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 15.2660ms 13.8984ms 71.9510 Ops/s 71.2024 Ops/s $\color{#35bf28}+1.05\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 17.7022ms 15.4712ms 64.6361 Ops/s 65.8989 Ops/s $\color{#d91a1a}-1.92\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 27.9142ms 22.7103ms 44.0329 Ops/s 44.3030 Ops/s $\color{#d91a1a}-0.61\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.1407ms 15.5943ms 64.1260 Ops/s 65.2568 Ops/s $\color{#d91a1a}-1.73\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 24.9517ms 22.5146ms 44.4156 Ops/s 44.6949 Ops/s $\color{#d91a1a}-0.62\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.1156ms 16.8308ms 59.4148 Ops/s 59.3636 Ops/s $\color{#35bf28}+0.09\%$

Copy link

github-actions bot commented Dec 20, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7076s 0.7070s 1.4145 Ops/s 1.3691 Ops/s $\color{#35bf28}+3.31\%$
test_transformed 0.9747s 0.9635s 1.0379 Ops/s 1.0436 Ops/s $\color{#d91a1a}-0.55\%$
test_serial 2.2052s 2.1142s 0.4730 Ops/s 0.4695 Ops/s $\color{#35bf28}+0.75\%$
test_parallel 1.9156s 1.8539s 0.5394 Ops/s 0.5438 Ops/s $\color{#d91a1a}-0.80\%$
test_step_mdp_speed[True-True-True-True-True] 0.1360ms 41.0685μs 24.3496 KOps/s 25.1214 KOps/s $\color{#d91a1a}-3.07\%$
test_step_mdp_speed[True-True-True-True-False] 0.1211ms 23.5308μs 42.4975 KOps/s 41.8435 KOps/s $\color{#35bf28}+1.56\%$
test_step_mdp_speed[True-True-True-False-True] 56.9730μs 22.3606μs 44.7214 KOps/s 43.6594 KOps/s $\color{#35bf28}+2.43\%$
test_step_mdp_speed[True-True-True-False-False] 0.1247ms 13.1060μs 76.3007 KOps/s 76.4013 KOps/s $\color{#d91a1a}-0.13\%$
test_step_mdp_speed[True-True-False-True-True] 76.3640μs 42.8331μs 23.3464 KOps/s 23.4283 KOps/s $\color{#d91a1a}-0.35\%$
test_step_mdp_speed[True-True-False-True-False] 70.6440μs 25.7247μs 38.8731 KOps/s 37.6781 KOps/s $\color{#35bf28}+3.17\%$
test_step_mdp_speed[True-True-False-False-True] 0.2211ms 24.6049μs 40.6423 KOps/s 39.0196 KOps/s $\color{#35bf28}+4.16\%$
test_step_mdp_speed[True-True-False-False-False] 0.2020ms 15.3222μs 65.2650 KOps/s 64.0112 KOps/s $\color{#35bf28}+1.96\%$
test_step_mdp_speed[True-False-True-True-True] 0.2512ms 45.2816μs 22.0840 KOps/s 21.9160 KOps/s $\color{#35bf28}+0.77\%$
test_step_mdp_speed[True-False-True-True-False] 0.2243ms 28.0306μs 35.6753 KOps/s 35.0195 KOps/s $\color{#35bf28}+1.87\%$
test_step_mdp_speed[True-False-True-False-True] 82.6240μs 24.1398μs 41.4254 KOps/s 40.0003 KOps/s $\color{#35bf28}+3.56\%$
test_step_mdp_speed[True-False-True-False-False] 0.1204ms 15.3937μs 64.9615 KOps/s 63.8945 KOps/s $\color{#35bf28}+1.67\%$
test_step_mdp_speed[True-False-False-True-True] 83.2040μs 47.5916μs 21.0121 KOps/s 20.7852 KOps/s $\color{#35bf28}+1.09\%$
test_step_mdp_speed[True-False-False-True-False] 61.1930μs 30.3171μs 32.9847 KOps/s 32.3265 KOps/s $\color{#35bf28}+2.04\%$
test_step_mdp_speed[True-False-False-False-True] 0.1340ms 27.3879μs 36.5124 KOps/s 36.8988 KOps/s $\color{#d91a1a}-1.05\%$
test_step_mdp_speed[True-False-False-False-False] 76.6940μs 17.6514μs 56.6527 KOps/s 56.2125 KOps/s $\color{#35bf28}+0.78\%$
test_step_mdp_speed[False-True-True-True-True] 79.5640μs 45.6164μs 21.9219 KOps/s 22.0272 KOps/s $\color{#d91a1a}-0.48\%$
test_step_mdp_speed[False-True-True-True-False] 67.9230μs 28.4924μs 35.0971 KOps/s 35.1764 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[False-True-True-False-True] 65.2230μs 28.5200μs 35.0631 KOps/s 33.9729 KOps/s $\color{#35bf28}+3.21\%$
test_step_mdp_speed[False-True-True-False-False] 48.3720μs 17.0928μs 58.5041 KOps/s 57.0818 KOps/s $\color{#35bf28}+2.49\%$
test_step_mdp_speed[False-True-False-True-True] 0.1075ms 47.9219μs 20.8673 KOps/s 20.7952 KOps/s $\color{#35bf28}+0.35\%$
test_step_mdp_speed[False-True-False-True-False] 0.1067ms 30.1121μs 33.2092 KOps/s 32.7001 KOps/s $\color{#35bf28}+1.56\%$
test_step_mdp_speed[False-True-False-False-True] 3.1278ms 31.3798μs 31.8676 KOps/s 31.5042 KOps/s $\color{#35bf28}+1.15\%$
test_step_mdp_speed[False-True-False-False-False] 52.9630μs 19.5965μs 51.0294 KOps/s 50.7815 KOps/s $\color{#35bf28}+0.49\%$
test_step_mdp_speed[False-False-True-True-True] 92.8950μs 49.9605μs 20.0158 KOps/s 19.7468 KOps/s $\color{#35bf28}+1.36\%$
test_step_mdp_speed[False-False-True-True-False] 65.8940μs 32.9150μs 30.3813 KOps/s 30.0151 KOps/s $\color{#35bf28}+1.22\%$
test_step_mdp_speed[False-False-True-False-True] 92.3250μs 30.8556μs 32.4091 KOps/s 32.0836 KOps/s $\color{#35bf28}+1.01\%$
test_step_mdp_speed[False-False-True-False-False] 0.1979ms 19.5820μs 51.0672 KOps/s 50.8327 KOps/s $\color{#35bf28}+0.46\%$
test_step_mdp_speed[False-False-False-True-True] 0.2179ms 51.9945μs 19.2328 KOps/s 19.0168 KOps/s $\color{#35bf28}+1.14\%$
test_step_mdp_speed[False-False-False-True-False] 0.2162ms 34.9367μs 28.6232 KOps/s 28.7661 KOps/s $\color{#d91a1a}-0.50\%$
test_step_mdp_speed[False-False-False-False-True] 66.8140μs 32.3314μs 30.9297 KOps/s 30.3631 KOps/s $\color{#35bf28}+1.87\%$
test_step_mdp_speed[False-False-False-False-False] 67.9630μs 21.9476μs 45.5630 KOps/s 46.4274 KOps/s $\color{#d91a1a}-1.86\%$
test_values[generalized_advantage_estimate-True-True] 24.7495ms 24.2582ms 41.2231 Ops/s 42.1935 Ops/s $\color{#d91a1a}-2.30\%$
test_values[vec_generalized_advantage_estimate-True-True] 93.8567ms 2.7664ms 361.4750 Ops/s 349.4558 Ops/s $\color{#35bf28}+3.44\%$
test_values[td0_return_estimate-False-False] 0.1123ms 76.8506μs 13.0123 KOps/s 13.1639 KOps/s $\color{#d91a1a}-1.15\%$
test_values[td1_return_estimate-False-False] 54.6283ms 54.0964ms 18.4855 Ops/s 18.6995 Ops/s $\color{#d91a1a}-1.14\%$
test_values[vec_td1_return_estimate-False-False] 1.2615ms 1.0714ms 933.3866 Ops/s 937.3100 Ops/s $\color{#d91a1a}-0.42\%$
test_values[td_lambda_return_estimate-True-False] 86.4830ms 85.9182ms 11.6390 Ops/s 11.8037 Ops/s $\color{#d91a1a}-1.40\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.2221ms 1.0590ms 944.2613 Ops/s 938.8887 Ops/s $\color{#35bf28}+0.57\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.2240ms 23.8505ms 41.9278 Ops/s 42.1918 Ops/s $\color{#d91a1a}-0.63\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0177ms 0.7391ms 1.3529 KOps/s 1.3452 KOps/s $\color{#35bf28}+0.58\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.8018ms 0.6576ms 1.5206 KOps/s 1.5252 KOps/s $\color{#d91a1a}-0.30\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.7100ms 1.4678ms 681.3055 Ops/s 681.3695 Ops/s $-0.01\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.8324ms 0.6725ms 1.4870 KOps/s 1.4932 KOps/s $\color{#d91a1a}-0.41\%$
test_dqn_speed[False-None] 7.2297ms 1.5034ms 665.1759 Ops/s 661.0121 Ops/s $\color{#35bf28}+0.63\%$
test_dqn_speed[False-backward] 2.6217ms 2.0934ms 477.6887 Ops/s 474.5991 Ops/s $\color{#35bf28}+0.65\%$
test_dqn_speed[True-None] 0.7161ms 0.5501ms 1.8178 KOps/s 1.7988 KOps/s $\color{#35bf28}+1.06\%$
test_dqn_speed[True-backward] 1.3566ms 1.2036ms 830.8445 Ops/s 901.1282 Ops/s $\textbf{\color{#d91a1a}-7.80\%}$
test_dqn_speed[reduce-overhead-None] 0.7526ms 0.5705ms 1.7528 KOps/s 1.7652 KOps/s $\color{#d91a1a}-0.71\%$
test_dqn_speed[reduce-overhead-backward] 1.1180ms 0.9756ms 1.0250 KOps/s 1.0029 KOps/s $\color{#35bf28}+2.20\%$
test_ddpg_speed[False-None] 3.1870ms 2.8371ms 352.4736 Ops/s 349.5398 Ops/s $\color{#35bf28}+0.84\%$
test_ddpg_speed[False-backward] 4.4485ms 4.1017ms 243.8000 Ops/s 249.4854 Ops/s $\color{#d91a1a}-2.28\%$
test_ddpg_speed[True-None] 1.3970ms 1.0785ms 927.2327 Ops/s 916.2947 Ops/s $\color{#35bf28}+1.19\%$
test_ddpg_speed[True-backward] 2.4862ms 2.2989ms 434.9947 Ops/s 458.1055 Ops/s $\textbf{\color{#d91a1a}-5.04\%}$
test_ddpg_speed[reduce-overhead-None] 1.3564ms 1.1443ms 873.9288 Ops/s 906.7097 Ops/s $\color{#d91a1a}-3.62\%$
test_ddpg_speed[reduce-overhead-backward] 1.9773ms 1.7788ms 562.1769 Ops/s 603.2223 Ops/s $\textbf{\color{#d91a1a}-6.80\%}$
test_sac_speed[False-None] 8.3174ms 7.9003ms 126.5777 Ops/s 125.1086 Ops/s $\color{#35bf28}+1.17\%$
test_sac_speed[False-backward] 11.5927ms 10.9388ms 91.4181 Ops/s 92.5421 Ops/s $\color{#d91a1a}-1.21\%$
test_sac_speed[True-None] 1.9228ms 1.5695ms 637.1318 Ops/s 648.0193 Ops/s $\color{#d91a1a}-1.68\%$
test_sac_speed[True-backward] 3.9008ms 3.4059ms 293.6073 Ops/s 309.9183 Ops/s $\textbf{\color{#d91a1a}-5.26\%}$
test_sac_speed[reduce-overhead-None] 23.6497ms 12.6980ms 78.7527 Ops/s 79.9779 Ops/s $\color{#d91a1a}-1.53\%$
test_sac_speed[reduce-overhead-backward] 1.7059ms 1.5237ms 656.2857 Ops/s 737.0763 Ops/s $\textbf{\color{#d91a1a}-10.96\%}$
test_redq_speed[False-None] 8.3303ms 7.4106ms 134.9427 Ops/s 133.7967 Ops/s $\color{#35bf28}+0.86\%$
test_redq_speed[False-backward] 12.1631ms 11.3685ms 87.9626 Ops/s 89.8467 Ops/s $\color{#d91a1a}-2.10\%$
test_redq_speed[True-None] 2.1467ms 1.9649ms 508.9226 Ops/s 506.7267 Ops/s $\color{#35bf28}+0.43\%$
test_redq_speed[True-backward] 3.7627ms 3.5976ms 277.9640 Ops/s 260.9634 Ops/s $\textbf{\color{#35bf28}+6.51\%}$
test_redq_speed[reduce-overhead-None] 2.2865ms 1.9923ms 501.9251 Ops/s 501.6074 Ops/s $\color{#35bf28}+0.06\%$
test_redq_speed[reduce-overhead-backward] 4.2257ms 3.8090ms 262.5349 Ops/s 262.8900 Ops/s $\color{#d91a1a}-0.14\%$
test_redq_deprec_speed[False-None] 9.4065ms 8.9071ms 112.2695 Ops/s 110.1427 Ops/s $\color{#35bf28}+1.93\%$
test_redq_deprec_speed[False-backward] 12.8741ms 12.0494ms 82.9916 Ops/s 82.3722 Ops/s $\color{#35bf28}+0.75\%$
test_redq_deprec_speed[True-None] 2.7917ms 2.4143ms 414.1985 Ops/s 430.2473 Ops/s $\color{#d91a1a}-3.73\%$
test_redq_deprec_speed[True-backward] 4.2232ms 3.9518ms 253.0478 Ops/s 241.3236 Ops/s $\color{#35bf28}+4.86\%$
test_redq_deprec_speed[reduce-overhead-None] 2.6232ms 2.3105ms 432.8046 Ops/s 435.1944 Ops/s $\color{#d91a1a}-0.55\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.2168ms 4.0183ms 248.8624 Ops/s 240.2890 Ops/s $\color{#35bf28}+3.57\%$
test_td3_speed[False-None] 7.8411ms 7.7972ms 128.2509 Ops/s 127.8270 Ops/s $\color{#35bf28}+0.33\%$
test_td3_speed[False-backward] 10.5924ms 10.0222ms 99.7782 Ops/s 97.4588 Ops/s $\color{#35bf28}+2.38\%$
test_td3_speed[True-None] 1.6175ms 1.5847ms 631.0284 Ops/s 635.3911 Ops/s $\color{#d91a1a}-0.69\%$
test_td3_speed[True-backward] 3.3975ms 3.1633ms 316.1286 Ops/s 304.0590 Ops/s $\color{#35bf28}+3.97\%$
test_td3_speed[reduce-overhead-None] 81.8095ms 26.3748ms 37.9149 Ops/s 37.0963 Ops/s $\color{#35bf28}+2.21\%$
test_td3_speed[reduce-overhead-backward] 1.6355ms 1.4845ms 673.6094 Ops/s 670.8758 Ops/s $\color{#35bf28}+0.41\%$
test_cql_speed[False-None] 17.1538ms 16.5955ms 60.2574 Ops/s 59.7736 Ops/s $\color{#35bf28}+0.81\%$
test_cql_speed[False-backward] 22.3732ms 21.8199ms 45.8296 Ops/s 45.5622 Ops/s $\color{#35bf28}+0.59\%$
test_cql_speed[True-None] 3.1294ms 2.9354ms 340.6709 Ops/s 341.1279 Ops/s $\color{#d91a1a}-0.13\%$
test_cql_speed[True-backward] 5.8541ms 5.2806ms 189.3715 Ops/s 191.1387 Ops/s $\color{#d91a1a}-0.92\%$
test_cql_speed[reduce-overhead-None] 21.5159ms 13.2337ms 75.5644 Ops/s 75.8735 Ops/s $\color{#d91a1a}-0.41\%$
test_cql_speed[reduce-overhead-backward] 1.6522ms 1.5296ms 653.7681 Ops/s 650.9197 Ops/s $\color{#35bf28}+0.44\%$
test_a2c_speed[False-None] 3.6084ms 3.2665ms 306.1352 Ops/s 312.7943 Ops/s $\color{#d91a1a}-2.13\%$
test_a2c_speed[False-backward] 6.3256ms 5.8951ms 169.6322 Ops/s 168.0429 Ops/s $\color{#35bf28}+0.95\%$
test_a2c_speed[True-None] 1.1832ms 1.0155ms 984.7512 Ops/s 982.3425 Ops/s $\color{#35bf28}+0.25\%$
test_a2c_speed[True-backward] 2.8091ms 2.6190ms 381.8211 Ops/s 384.0675 Ops/s $\color{#d91a1a}-0.58\%$
test_a2c_speed[reduce-overhead-None] 21.7150ms 11.6758ms 85.6472 Ops/s 92.4664 Ops/s $\textbf{\color{#d91a1a}-7.37\%}$
test_a2c_speed[reduce-overhead-backward] 1.1413ms 1.0795ms 926.3350 Ops/s 1.0089 KOps/s $\textbf{\color{#d91a1a}-8.19\%}$
test_ppo_speed[False-None] 4.0011ms 3.6449ms 274.3568 Ops/s 272.1319 Ops/s $\color{#35bf28}+0.82\%$
test_ppo_speed[False-backward] 7.3399ms 6.8582ms 145.8117 Ops/s 151.5901 Ops/s $\color{#d91a1a}-3.81\%$
test_ppo_speed[True-None] 1.1687ms 0.9768ms 1.0237 KOps/s 1.0463 KOps/s $\color{#d91a1a}-2.16\%$
test_ppo_speed[True-backward] 2.9638ms 2.6992ms 370.4814 Ops/s 391.8451 Ops/s $\textbf{\color{#d91a1a}-5.45\%}$
test_ppo_speed[reduce-overhead-None] 0.6783ms 0.5266ms 1.8988 KOps/s 1.8543 KOps/s $\color{#35bf28}+2.40\%$
test_ppo_speed[reduce-overhead-backward] 1.2668ms 1.1250ms 888.8697 Ops/s 852.8448 Ops/s $\color{#35bf28}+4.22\%$
test_reinforce_speed[False-None] 2.5281ms 2.2426ms 445.9189 Ops/s 439.3099 Ops/s $\color{#35bf28}+1.50\%$
test_reinforce_speed[False-backward] 3.6850ms 3.3057ms 302.5091 Ops/s 299.8876 Ops/s $\color{#35bf28}+0.87\%$
test_reinforce_speed[True-None] 0.9978ms 0.8329ms 1.2006 KOps/s 1.1249 KOps/s $\textbf{\color{#35bf28}+6.73\%}$
test_reinforce_speed[True-backward] 3.0329ms 2.5472ms 392.5824 Ops/s 388.6665 Ops/s $\color{#35bf28}+1.01\%$
test_reinforce_speed[reduce-overhead-None] 23.2563ms 11.7312ms 85.2428 Ops/s 88.7599 Ops/s $\color{#d91a1a}-3.96\%$
test_reinforce_speed[reduce-overhead-backward] 1.2464ms 1.1824ms 845.7693 Ops/s 817.0831 Ops/s $\color{#35bf28}+3.51\%$
test_iql_speed[False-None] 9.9715ms 9.1892ms 108.8239 Ops/s 108.7117 Ops/s $\color{#35bf28}+0.10\%$
test_iql_speed[False-backward] 14.0497ms 12.9913ms 76.9744 Ops/s 77.0748 Ops/s $\color{#d91a1a}-0.13\%$
test_iql_speed[True-None] 2.0822ms 1.7793ms 562.0120 Ops/s 562.7325 Ops/s $\color{#d91a1a}-0.13\%$
test_iql_speed[True-backward] 4.3599ms 4.1940ms 238.4378 Ops/s 235.8283 Ops/s $\color{#35bf28}+1.11\%$
test_iql_speed[reduce-overhead-None] 19.6849ms 11.2961ms 88.5264 Ops/s 87.9834 Ops/s $\color{#35bf28}+0.62\%$
test_iql_speed[reduce-overhead-backward] 1.7508ms 1.6086ms 621.6632 Ops/s 699.6753 Ops/s $\textbf{\color{#d91a1a}-11.15\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 8.0262ms 6.4205ms 155.7503 Ops/s 153.1732 Ops/s $\color{#35bf28}+1.68\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.4806ms 0.2693ms 3.7129 KOps/s 3.1553 KOps/s $\textbf{\color{#35bf28}+17.67\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5428ms 0.2707ms 3.6947 KOps/s 3.6309 KOps/s $\color{#35bf28}+1.76\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5873ms 6.1286ms 163.1684 Ops/s 159.0192 Ops/s $\color{#35bf28}+2.61\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.9218ms 0.3025ms 3.3053 KOps/s 3.8409 KOps/s $\textbf{\color{#d91a1a}-13.94\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5681ms 0.3187ms 3.1377 KOps/s 3.1998 KOps/s $\color{#d91a1a}-1.94\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7426ms 1.3431ms 744.5509 Ops/s 700.2987 Ops/s $\textbf{\color{#35bf28}+6.32\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.4999ms 1.2484ms 801.0199 Ops/s 754.8058 Ops/s $\textbf{\color{#35bf28}+6.12\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6759ms 6.3051ms 158.6020 Ops/s 155.5137 Ops/s $\color{#35bf28}+1.99\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.0433ms 0.4604ms 2.1721 KOps/s 2.0909 KOps/s $\color{#35bf28}+3.88\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6278ms 0.4054ms 2.4666 KOps/s 2.3792 KOps/s $\color{#35bf28}+3.67\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.5069ms 6.1647ms 162.2140 Ops/s 160.1757 Ops/s $\color{#35bf28}+1.27\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9078ms 0.2762ms 3.6203 KOps/s 3.1125 KOps/s $\textbf{\color{#35bf28}+16.32\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6261ms 0.3364ms 2.9731 KOps/s 3.6044 KOps/s $\textbf{\color{#d91a1a}-17.51\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5081ms 6.0841ms 164.3628 Ops/s 160.8770 Ops/s $\color{#35bf28}+2.17\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8217ms 0.2793ms 3.5800 KOps/s 2.9024 KOps/s $\textbf{\color{#35bf28}+23.34\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5098ms 0.2836ms 3.5263 KOps/s 3.0446 KOps/s $\textbf{\color{#35bf28}+15.82\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.6519ms 6.3164ms 158.3191 Ops/s 156.3432 Ops/s $\color{#35bf28}+1.26\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8918ms 0.5053ms 1.9789 KOps/s 2.1484 KOps/s $\textbf{\color{#d91a1a}-7.89\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6799ms 0.4554ms 2.1958 KOps/s 2.3261 KOps/s $\textbf{\color{#d91a1a}-5.60\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1437ms 5.4374ms 183.9121 Ops/s 184.1218 Ops/s $\color{#d91a1a}-0.11\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.5759ms 2.0610ms 485.2044 Ops/s 442.1935 Ops/s $\textbf{\color{#35bf28}+9.73\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.1673ms 1.2116ms 825.3214 Ops/s 794.7298 Ops/s $\color{#35bf28}+3.85\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 7.2011ms 5.5046ms 181.6661 Ops/s 186.1583 Ops/s $\color{#d91a1a}-2.41\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 9.2447ms 2.0719ms 482.6404 Ops/s 434.2255 Ops/s $\textbf{\color{#35bf28}+11.15\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.9787ms 1.1596ms 862.3878 Ops/s 786.5648 Ops/s $\textbf{\color{#35bf28}+9.64\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.5144s 15.8302ms 63.1706 Ops/s 32.5167 Ops/s $\textbf{\color{#35bf28}+94.27\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.5572ms 2.3234ms 430.3999 Ops/s 441.3325 Ops/s $\color{#d91a1a}-2.48\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.2514ms 1.2847ms 778.3981 Ops/s 751.6665 Ops/s $\color{#35bf28}+3.56\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 16.0893ms 15.3372ms 65.2012 Ops/s 64.4781 Ops/s $\color{#35bf28}+1.12\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 20.0733ms 17.7915ms 56.2068 Ops/s 57.9428 Ops/s $\color{#d91a1a}-3.00\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 20.6941ms 19.8299ms 50.4289 Ops/s 48.6564 Ops/s $\color{#35bf28}+3.64\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 20.3123ms 17.7664ms 56.2861 Ops/s 57.0309 Ops/s $\color{#d91a1a}-1.31\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 19.8080ms 19.5908ms 51.0443 Ops/s 48.2893 Ops/s $\textbf{\color{#35bf28}+5.71\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 21.4398ms 19.3855ms 51.5850 Ops/s 53.3287 Ops/s $\color{#d91a1a}-3.27\%$

@vmoens vmoens added the CI Has to do with CI setup (e.g. wheels & builds, tests...) label Dec 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants